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PAGE:1 RAW SEQUENCE LISTING DATE: 02/23/96 

PATENT APPLICATION' VS/08/487, 032 TIME: 1 1 :34:47 

INPUT SET: S8924.mw 



This Raw Listing contains the General 
Information Section and those Sequences 
containing ERRORS. 




1 SEQUENCE LISTING 
2 

3 (1) General Information: 
4 

5 (i) APPLICANT: DOUGLAS SMITH 
6 

7 (ii) TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES 

8 RELATING TO HELICOBACTER PYLORI FOR 

9 DIAGNOSTICS AND THERAPEUTICS 
10 

— > IX (iii) NUMBER OF SEQUENCES: 880 

12 

13 (iv) CORRESPONDENCE ADDRESS: 

14 (A) ADDRESSEE: LAHIVE & COCKFIELD 

15 (B) STREET: 60 State Street, Suite 510 

16 (C) CITY: Boston 

17 (D) STATE: Massachusetts 

18 (E) COUNTRY: USA 

19 (F) ZIP: 02109-1875 
20 

21 (V) COMPUTER READABLE FORM: 

22 (A) MEDIUM TYPE: Floppy disk 
2 3 (B) COMPUTER: IBM PC compatible 

24 (C) OPERATING SYSTEM: PC- DOS/MS- DOS 

25 (D) SOFTWARE: Patentin Release #1.0, Version #1,25 
26 

27 (vi) CURRENT APPLICATION DATA: 

28 (A) APPLICATION NUMBER: US 08/487,032 

29 (B) FILING DATE: 07-JUNE-1995 
30 

31 (Viii) ATTORNEY/ AGENT INFORMATION: 

32 (A) NAME: Mandragouras, Amy E. 

33 (B) REGISTRATION NUMBER: 36,207 

34 (C) REFERENCE/DOCKET NUMBER: GTN-001 
35 

36 (ix) TELECOMMUNICATION INFORMATION: 

37 (A) TELEPHONE: (617)227-7400 

38 (B) TELEFAX: (617)227-5941 
39 




ERRORED SEQUENCES FOLLOW: 



PAGE: 2 RAW SEQUENCE LISTING DATE: 02/23/96 

PATENT APPLICATION mm/487,032 TIME: 1 1 :34:52 

INPUT SET: S8924.mw 

5923 (2) INFORMATION FOR SEQ ID NO: 149: 
5924 

5925 (i) SEQUENCE CHARACTERISTICS: 

5926 (A) LENGTH: 1017 base pairs 

5927 (B) TYPE: nucleic acid 

5 928 (C) STRANDEDNESS : double 

5 929 (D) TOPOLOGY: circular 

5930 

5931 (ii) MOLECULE TYPE: DNA (genomic) 

5932 

5933 (iii) HYPOTHETICAL: NO 

5934 

5935 (iv) ANTI-SENSE: NO 

5936 ^ ^ 
5 937 (Vi) ORIGINAL SOURCE: 

5938 (A) ORGANISM: Helicobacter pylori 

5939 
5940 

5941 ^ 
— > 5942 (xi) SEQUENCE DESCRIPTION: (PHOSPHOMANNO : UTASE 




5943 ^ 

5 944 ATGATCACTG GCTCTCACAA CCCCAAAGAA TACAACGGCT TTAAAATCAC GCTCAATCAA 60 
5945 

5946 AACCCGTTTT ATGGCAAGGA CATTCAGGCT TTAAAAAACA CGCTTTTAAA CGCAAAGCAT 120 
5947 

5 948 GAAATAAAGC CCCTAAAAGA AACGCCAGAG AAAGTCAATG CCCTAGAAGC GTATCATCGC 180 
5949 

5 95 0 TATTTGATCA AGGATTTTAA GCATTTAAAA AATCTTAAAT ACAAAATCGC CCTGGATTTT 240 
5951 

5 952 GGTAATGGCG TGGGGGCGTT AGGATTAGAG CCGATTTTAA AGGCTTTAAA CATTGATTTT 300 
5953 

5 954 AGCAGCCTTT ATAGCGATCC TGATGGGGAT TTTCCTAACC ACCACCCAGA CCCTAGCGAA 360 
5955 

5 956 GCGAAAAACT TAAAAGACTT AGAAAAACAC ATGCGAGAAA ACGCTATTTT AATAGGCTTT 420 
5957 

5 958 GCTTTTGATG GCGATGCGGA TAGGATTGCG ATGCTAAGCT CTCATCATAT CTATGCGGGC 480 
5959 

5 960 GATGAATTAG CGATTTTATT CGCTAAACGC TTGCATGCTC AAGGCATCAC CCCTTTTGTG 540 
5961 

5 962 ATCGGCGAAG TCAAATGCTC TCAAGTGATG TATAACGCAA TCAATACTTT TGGTAAGACG 600 
5963 

5 964 CTCATGTATA AAACCGGGCA TAGCAATTTA AAAATCAAGC TCAAAGAAAC TAATGCGCAT 660 
5965 

5 966 TTTGCGGCTG AAATGAGCGG GCATATCTTT TTTAAAGAAC GCTATTTTGG CTATGATGAC 720 
5967 

5 968 GCTCTTTACG CATGTTTAAG GGCTTTGGAG TTATTGCTTG AACAAAGTCC AAGCGACTTG 780 
5969 

5970 GAAAACACCA TTAAAAACCT CCCCTATTCC TACACCACGC CTGAAGAAAA AATCGCCGTG 840 
5971 

5 97 2 AGCGAAGAAG AAAAATTTGA AATCATTCGC AACTTACAAG AAGCGCTTAA AAACCCGCCA 900 
5973 

5 974 AGCCATTTCC CTACAATCAA AGAAATCATC AGCATTGATG GCGTGAGAGT GGTTTTTGAA 960 
5975 



o 



PAGE: 3 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/487,032 



DATE: 02/23/96 
TIME: 11:34:56 



5976 
5977 
5978 



INPUT SET: S8924.raw 

CATGGCTTTG GGCTTATTCG CGCAAGCAAC ACCCACCCCC TATTTAGTCA GCCGCTT 1017 



— > 



-> 
-> 



— > 



7253 
7254 
7255 
7256 
7257 
7258 
7259 
7260 
7261 
7262 
7263 
7264 
7265 
7266 
7267 
7268 
7269 
7270 
7271 
7272 
7273 
7274 
7275 
7276 
7277 
7278 
7279 
7280 
7281 
7282 
7283 
7284 
7285 
7286 
7287 
7288 
7289 
7290 
7291 
7292 
7293 
7294 
7295 
7296 
7297 
7298 
7299 
7300 
7301 



(2) INFORMATION FOR SEQ ID NO: 17 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 
GTGGGTGTCT TATCCCTCAA AATAGAGGCA ATTTCTAATT TTTATGGGTT ATGCGTTTTA 
GGGGTGTTGT TAGCATGTTT TTATCTTTTA GACGCTTATT ATCTCATGCA AGAAAGGCTG 
TTTAGGGAGC AATACCAATG GCTAATAAAA AACCGACTTA AAACCGATGA AAGGCTGTTT 
GAAGTCTTCC CTATTCATCA AACTTGCCAA TCAACGCAAT TCTTATCGCC ATGCGTTCGT 



TTAGTCTTTT CCCCTATTGG GCGT 

H A P T (D!E$J Aqll 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



b dvQL _ FH iq acid 




7 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
(vi) 0 NiOTAAAGCGGTC ^ 240 
TTTTTCACCC ACCATACTTT AAAGGCTTCG TTTGAGCCGA CTAACCACAT CAATTATAGA 300 
GGGCATGACT ATGTGTTGGA TAATGTGCAT TTCCACGCCC CTATGGAGTT TTTAATCAAT 360 
AATAAAACCA GGCCTTTGAG CGCGCATTTC GTGCATAAAG ACGCTAAAGG GCGTTTGTTG 4 20 

GTGTTAGCGA TTGGTTTTGA AGAAGGGAAA GAAAACCCCA ACCTTGATCC TATTTTAGAA 480 
GGCATTCAAA AGAAACAAAA TCTTAAAGAG GTGGCTTTAG ACGCTTTCTT GCCTAAAAGC 540 



o 
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RAW SEQUENCE LISTING 

PATENT APPLICATION USm/487,032 



DATE: 02/23/96 
TIME: 11:35:00 



7302 
7303 
7304 
7305 
7306 
7307 
7308 
7309 
7310 
7311 



INPUT SET: S8924.mw 

ATCAATTACT ACCATTTTAA CGGCTCTCTC ACCGCTCCTC CTTGCACAGA GGGGGTGGCA 600 
TGGTTTGTCA TAGAAGAACC TTTGGAAGTT TCTGCCAAAC AATTGGCTGA AATCAAAAAA 660 
CGCATGAAAA ATTCGCCCAA CCAACGCCCC GTCCAGCCTG ACTACAACAC CGTGATCATT 720 
AAAAGCTCGG CTGAGACCCQ C 741 



— > 



7312 

7313 
7314 
7315 
7316 
7317 
7318 
7319 
7320 
7321 
7322 
7323 
7324 
7325 
7326 
7327 
7328 
7329 
7330 
7331 
7332 
7333 
7334 
7335 
7336 
7337 
7338 
7339 
7340 
7341 
7342 
7343 
7344 
7345 
7346 
7347 
7348 
7349 
7350 
7351 
7352 
7353 



(2) INFORKATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1266 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

ATGAAAATTT CTTTATTGGG GCATGGAAAA ACCACTCTAG CCCTAGGGCG TTTTTTTAAA 60 

AAAAACCATA ATGAAGTCAA ATTTTTTGAT GATAAATTCC CTGCATTTTT TAAGGATAGC 120 

GAGGGTTTTC TTTGCTACCC TAGTAAGGAT TTTAACCCTA ATGATTCCCA ACTAGAAATC 180 

GTCAGCCCTG GCATTAGTTT CACGCACCCT TTAGTCATGA AAGCCAAGCA TTTAATGAGC 240 

GAATACGATT ATATTGATAG TTTGTTTGAT CATTCTTTCA CGCCTACGAT GATAAGTATT 300 

AGCGGCACTA ACGGGAAAAC CACCACGACC GAAATGCTCA CCACACTTTT AGAAGATTTT 360 

AAGGCTGTGA GTGGGGGGAA TATCGGCACG CCCTTGATTG AATTGTTTGA AAAACGATCG 420 

CCCTTGTGGG TGCTAGAAAC AAGCTCCTTT TCTTTGCATT ACACTAATAA GGCTTACCCT 480 

TTAATCTACT TGCTCATCAA TGTGGAAGCC GATCATTTGA CTTGGCATTG CAATTTTGAA 540 

AATTATTTGA ACGCTAAACT CAAGGTTTTA ACATTGATGC CTAAAACTTC GCTCGCTATC 600 

CTCCCTTTAA AATTCAAAGA ACACCCTATT GTTCAAAACT CGCAAGCGCA AAAAATCTTT 660 

TTTGACAAAA GCGAAGAGGT TTTAGAGTGT TTAAAAATCC CTTCTAACGC CCTTTTTTTT 720 



O 
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7354 
7355 
7356 
7357 
7358 
7359 
7360 
7361 
7362 
7363 
7364 
7365 
7366 
7367 
7368 
7369 
7370 
7371 
7372 
7373 
7374 
7375 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/487,032 



DATE: 02/23/96 
TIME: 11:35:04 



INPUT SET: S8924.mw 



AAGGGAGCGT 


TTTTATTAGA 


CGCGGCTTTA 


GCCCTTTTAG 


TTTATGAGCA 


ATTTTTAAAA 


780 


ATAAAGAATT 


TAAAATGGCA 


AGATTATAGA 


GAAAACGCCC 


TTAAAAGACT 


GAACGCTTTT 


840 


AAAATCGGCT 


CGCATAAAAT 


GGAAGAATTT 


AGGGATAAAC 


AAGGGCGTTT 


GTGGGTAGAT 


900 


GACAGCAAAG 


CCACGAATAT 


TGATGCCACC 


TTACAAGCCC 


TAAAAACCTT 


TAAAAACCAA 


960 






nnnf*n AT ATT 




Ail lAAUCCL' 


1 1 i 1 1 uAA 




GAGTTTAAAA 


ACTATAAAAT 


AAGCCTTTAT 


GCCATAGGAT 


CAAGCGCTTC 


TATCATACAA 


1080 


GCCTTAGCGT 


TAGAATTTAA 


TGTTTCTTGT 


CAGGTTTGTT 


TGAAGTTAGA 


AAAAGCGGTT 


1140 


CAAGAAATTA 


AAAGCGTTTT 


ATTACAAAAT 


GAAGTCGCTT 


TGCTTTCACC 


TAGCGCGGCC 


1200 


AGTTTGGATC 


AATTTTCTTC 


GTATAAAGAA 


AGGGGTGAAA 


AATTCAAAGC 


GTTTGTTTTA 


1260 


AAAGAT 












1266 



— > 



8045 
8046 
8047 
8048 
8049 
8050 
8051 
8052 
8053 
8054 
8055 
8056 
8057 
8058 
8059 
8060 
8061 
8062 
8063 
8064 
8065 
8066 
8067 
8068 
8069 
8070 
8071 
8072 
8073 
8074 



(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(xi) SEQUENCE DESCRIPTION: PHOSPHOMAN] 




ATGATCACTG 


GCTCTCACAA 


CCCCAAAGAA 


TACAACGGCT 


TTAAAATCAC 


GCTCAATCAA 


60 


AACCCGTTTT 


ATGGCAAGGA 


CATTCAGGCT 


TTAAAAAACA 


CGCTTTTAAA 


CGCAAAGCAT 


120 


GAAATAAAGC 


CCCTAAAAGA 


AACGCCAGAG 


AAAGTCAATG 


CCCTAGAAGC 


GTATCATCGC 


180 


TATTTGATCA 


AGGATTTTAA 


GCATTTAAAA 


AATCTTAAAT 


ACAAAATCGC 


CCTGGATTTT 


240 


GGTAATGGCG 


TGGGGGCGTT 


AGGATTAGAG 


CCGATTTTAA 


AGGCTTTAAA 


CATTGATTTT 


300 



o 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/487,032 



DATE: 02/23/96 
TIME: 11:35:08 



8075 
8076 
8077 
8078 
8079 
8080 
8081 
8082 
8083 
8084 
8085 
8086 
8087 
8088 
8089 
8090 
8091 
8092 
8093 
8094 
8095 
8096 
8097 
8098 
8099 
8100 



INPUT SET: S8924.mw 

AGCAGCCTTT ATAGCGATCC TGATGGGGAT TTTCCTAACC ACCACCCAGA CCCTAGCGAA 360 

GCGAAAAACT TAAAAGACTT AGAAAAACAC ATGCGAGAAA ACGCTATTTT AATAGGCTTT 420 

GCTTTTGATG GCGATGCGGA TAGGATTGCG ATGCTAAGCT CTCATCATAT CTATGCGGGC 480 

GATGAATTAG CGATTTTATT CGCTAAACGC TTGCATGCTC AAGGCATCAC CCCTTTTGTG 540 

ATCGGCGAAG TCAAATGCTC TCAAGTGATG TATAACGCAA TCAATACTTT TGGTAAGACG 600 

CTCATGTATA AAACCGGGCA TAGCAATTTA AAAATCAAGC TCAAAGAAAC TAATGCGCAT 660 

TTTGCGGCTG AAATGAGCGG GCATATCTTT TTTAAAGAAC GCTATTTTGG CTATGATGAC 720 

GCTCTTTACG CATGTTTAAG GGCTTTGGAG TTATTGCTTG AACAAAGTCC AAGCGACTTG 780 

GAAAACACCA TTAAAAACCT CCCCTATTCC TACACCACGC CTGAAGAAAA AATCGCCGTG 840 

AGCGAAGAAG AAAAATTTGA AATCATTCGC AACTTACAAG AAGCGCTTAA AAACCCGCCA 900 

AGCCATTTCC CTACAATCAA AGAAATCATC AGCATTGATG GCGTGAGAGT GGTTTTTGAA 960 

CATGGCTTTG GGCTTATTCG CGCAAGCAAC ACCCACCCCC TATTTAGTCA GCCGCTT 1017 




— > 



— > 



8615 
8616 
8617 
8618 
8619 
8620 
8621 
8622 
8623 
8624 
8625 
8626 
8627 
8628 
8629 
8630 
8631 
8632 
8633 
8634 
8635 
8636 
8637 
8638 
8639 
8640 



(2) INFORMATION FOR SEQ ID Nq:162 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 753 base pairs 

y (A) ORGANISM: Helicobacter pylori 





(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1. . .300 

(D) OTHER INFORMATION: /note= " U^-N-ACETYLMURAMYL-TRIPEPTIDE 
SYNTHETASE" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0<^14|J 

ATGGGAGCGA TAGCGAGTTG TTACGCGCAT CAAATCATCT TAACTTCAGA CAATCCTAGA 60 

AGCGAAAACG AAGAAGACAT CATTAAGGAT ATTTTAAAAG GCATCAATAA TTCTTCTAAA 120 

GTCATTGTAG AAAAAGACCG AAAAAAGGCC ATTTTAAACG CTTTAGAAAA TTTAAAAGAC 180 

GATGAGGTGT TGTTGATTTT AGGCAAGGGC GATGAAAACA TTCAAATCTT TAAAGACAAA 240 

ACGATTTTTT TTAGCGACCA GGAAGTCGTT AAAGATTATT ATCTCAATTT AAAACAAGGA 300 



o 
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RAW SEQUENCE LISTING 

PATENT APPLICATION VS 1081487, 032 



DATE: 02/23/96 
TIME: 11:35:13 



INPUT SET: S8924.raw 



--> 



8641 

8642 
8643 
8644 
8645 
8646 
8647 
8648 
8649 
8650 
8651 
8652 
8653 
8654 
8655 
8656 
8657 
8658 
8659 
8660 
8661 
8662 
8663 
8664 
8665 
8666 
8667 
8668 
8669 
8670 
8671 
8672 
8673 



(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGy: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: l..,240 

(D) OTHER INFORMATION: /note= "FLAGELLAR MOTOR SWITCH PROTEIN F" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 
GTGATGGACA AACTCACTAA AAGCTTGCAA ACGCAAAAAA ACTTCGCTTA TTTAGGCAAA 60 
ATCAAGCCCC AACAACTCGC TGATTTCATC ATTAACGAAC ACCCTCAAAC CATCGCCTTG 120 
ATTTTGGCCC ACATGGAARC CCCTAATGCG GCTGAAACTT TGAGCTATTT CCCTGATGAA 180 
ATGAAAGCCG AGATTTCCAT TAGAATGGCG AATTTTAGGC GAAATATCGC CCCAAGTGGT 240 



8973 
8974 
8975 
8976 
8977 
8978 
8979 
8980 
8981 
8982 
8983 
8984 
8985 
8986 
8987 
8988 
8989 
-> 8990 



(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1263 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacte 

( X i ) SEQUENCE DESCRI PTI ON : (sEQ 




EQ ID NO: 224: 



o 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/487,032 



DATE: 02/23/96 
TIME: 11:35:17 



8991 
8992 
8993 
8994 
8995 
8996 
8997 
8998 
8999 
9000 
9001 
9002 
9003 
9004 
9005 
9006 
9007 
9008 
9009 
9010 
9011 
9012 
9013 
9014 
9015 
9016 
9017 
9018 
9019 
9020 
9021 
9022 
9023 
9024 
9025 
9026 
9027 
9028 
9029 
9030 
9031 
9032 
9033 
9034 
9035 
9036 



INPUT SET: S8924.mw 



ATGAAAGGTT TAACAATGAA AAAATTAGTT TTTAGCATGC TTTTATGTTG TAAAAGCGTG 60 

TTTGCAGAGG GGGAAACTCC TTTGATTGTC AATGACCCAG AAACCCATGT AAGTCAAGCC 120 

ACTATCATAG GCAAAATGGT AGATAGTATC AAAAGATACG AAGAGATTAT TTCTAAGGCT 180 

CAAGCTCAAG TCAATCAGTT ACAAAAAGTC AATAACATGA TAAATACGAC TAATTCTTTG 240 

ATTAQTAGTA GTGCTATCAC TTTAGCGAAT CCTATGCAAG TTTTACAAAA CGCTCAGTAT 300 

CAAATAGAGA GCATTAGATA CAACTATGAG AATTTAAAGC AAAGCATAGA AAATTGGAAC 360 

GCACAAAATT TGTTAAGAAA CAAATACTTA CAGCAACAAT GCCCTTGGCT TAATGTCAAT 420 

GCTCTTACTA ACAATAAGAT TGTCAATCTT AAAGATCTCA ATAACCTAAT CACCAAAAAT 480 

GGCGAACAAA CCCAAACCGC AAGAGATGTG CAAAATCTCA TTCAGTCCAT TAGTGGCAGT 540 

GGCTATGGAA ACATGCAATC ACTTGCTGGG GAATTGAGTG GTAGAGCGTG GGGGGAAATG 600 

TTGTGTAAAA TGGTAAACGA TAGTAATTAT GAAAGCGAGC AAGCTCTTTT AGCAACAGGC 660 

AATAACCCAG AAGAGCAAAA ACGAAGATTT TTGCTTAGAG TAAAGAAAAA GGTTAATGAT 720 

AATAAGCAGT TAAAAGATAA ACTTGACCCA TTTCTAAAAA GACTTGATGT CCTACAAACT 780 

GAGTTTGGTG TAACTGACCC TACAGCTAAC CATAATAAGC AAGGGATACA TTATTGCACA 840 

GAAAATAAAG AGACAGGTAA ATGCGACCCT ATTAAAAATG TATTTAGGAC AACTCGCTTA 900 

GATAACGAAT TAGAACAAGA AATCCAAACG CTCACACTTG ATTTAATCAA AGCCTCCAAT 960 

AAAGACGCTC AAAGCCAAGC CTACGCAAAT TTCAATCAAA GGATTAAATT ACTTACTCTA 1020 

AAATATTTAA AAGAAATTAC CAATCAAATG CTCTTTTTAA ATCAAACAAT GGCAATGCAA 1080 

AGCGAGATTA TGACAGATGA TTATTTTAGG CAAAATAATG ATGGCTTTGG GGAAAAAGAA 1140 

AACCATATAG ACGAACAATT AACGCAAAAA AGAATAAACG AAAGAGAAAG AGCTAGAATA 1200 

TACTTTCAAA ACCCTAATGT TAAATTTGAC CAATTTGGCT TTCCCATTTT TAGTATATGG 1260 

GAT 1263 



9101 
9102 
9103 
9104 
9105 
-> 9106 



(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2$bL A 
%% A8 D D A i ! "( A 

! P !P +h BO 8 d 



B 6 { F 




o 



PAGE: 9 RAW SEQUENCE LISTING DATE: 02/23/96 

PATENT APPLICATION US/08/487,032 TIME: 11:35:21 

INPUTSET: SS924.mw 

— > 9107 4 D 0, !$ D ( Q! ( " 



o 



Notice of Availability 



Applicant Aid for Biotechnology Computer Readable Form (CRF) 

Sequence Listings Submissions 

The Patent and Trademark Office (PTO) has developed a computer 
program, called Checker, that will aid applicants in identifying 
and correcting errors prior to making submissions for compliance 
with the Requirements for Patent Applications Containing 
Nucleotide Sequence and/or Amino Acid Sequence Disclosures 
(sequence rules: 37 CFR 1.821 through 1.825) • (Final rules were 
published in the Federal Register (55 FR 182 3 0) on May 1, 19 90, 
and in the PTO Official Gazette (1114 Of f .Gaz .PatOf f ice 29) on 
May 15, 1990.) 

Checker is a DOS-based software program that is intended to 
assist users in determining whether errors may be present in the 
sequence listings, and is not intended to guarantee that the 
submission is error-free. 

The most current version of the software will be available via 
computer downloading (details below) . Copies on diskette are 
also available. Updated software versions will not be 
automatically mailed out; any updates will be announced in the 
PTO Official Gazette. 

The software can be accessed/requested in the following 
locations: 

1) Dial-up access to the Patent and Trademark Office Bulletin 
Board System. 

Phone number: 703-305-8950 
Cost : Free-of -charge 

2) Dial-up access through the Internet. FTP site: ftp.uspto.gov 
Login as "anonymous". Software is in directory /pub/checker 
Cost : Free-of -charge 

3) For diskette copies, telephone requests to 703-308-0322. 
Cost: $25.00 



For Further Information Contact: Meredith Beckhardt at 703-308-4212. 



A * A A A A A AAAAAAAAAAAAAAAAAAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA*, AAAAAAAAAAAAAAAAAAAAAAA 
r^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAAAAAAAAAA 

AAAAAAAAAAAAAAA AAAAAAAAAAAAAA^AAAAAAAAA^AAAAAAAAAAAAA 

— _ I ^ 

CRF Diskette Problem Report 



A A A A a\ 



A A A A 



The Scientific and Technical Information Center (STIC) experienced 
a problem when processing the following CRF diskette: 



Application Serial Number: 
Filing Date: 



Classification: 



Date Reviewed by STIC: 

Point-of-Contact / Telephone No: Meredith Beckhardt 

Nature^f Problem: 



The CRF diskette was: 
CU Damaged 

[B^readable -S^S:^ OMOuckaJ^ On^ poj^-t^ 

□ Blank (no files present on the floppy disk) "t f j<. cuso ) <^ 

U .^c/ . ^ 

I I A computer virus was detected on the diskette. The STIC will not 
' — ' process the diskette through the Data Capture System. 

Name of the virus: 



I I The CRF diskette contains an error that disrupts normal processing, as 
explained below: 

I I The Sequence Listing was not converted into ASCII (DOS) text 
CZl See attached pages for clarification — > 
□ Other: 



~0 9/7/95 



